SYMMETRY REDUCTION OF OPTIMAL CONTROL SYSTEMS AND 

PRINCIPAL CONNECTIONS 
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Abstract. This paper explores the role of symmetries and reduction in nonlinear control and 
optimal control systems. The focus of the paper is to give a geometric framework of symmetry 
reduction of optimal control systems as well as to show how to obtain explicit expressions of the 
reduced system by exploiting the geometry. In particular, we show how to obtain a principal 
connection to be used in the reduction for various choices of symmetry groups, as opposed to 
assuming such a principal connection is given or choosing a particular symmetry group to simplify 
the setting. Our result synthesizes some previous works on symmetry reduction of nonlinear control 
and optimal control systems. Affine and kinematic optimal control systems are of particular interest: 
We explicitly work out the details for such systems and also show a few examples of symmetry 
reduction of kinematic optimal control problems. 



1. Introduction 

1.1. Background. Many control systems, particularly those arising from mechanical systems, have 
symmetries — often translational and rotational, and sometimes combinations of them. Such a 
symmetry is usually described as an invariance or equivariance under an action of a Lie group, and 
the system can be reduced to a lower- dimensional one or decoupled into subsystems by exploiting 
the symmetry. Nijmeijer and van der Schaft [36J and Grizzle and Marcus |14| formulated symmetries 
of nonlinear control systems from the differential-geometric point of view, and also showed how one 
can reduce a control system with symmetry to a quotient space. 

Likewise, optimal control systems also have such symmetries. Grizzle and Marcus |15| showed 
that, in relation to the work in |14j . one can decompose optimal feedback laws by exploiting 
the symmetries of control systems; van der Schaft |44j showed a method to analyze symmetries 
of optimal Hamiltonians without explicitly calculating them, while de Leon et al. [11] analyzed 
symmetries of vakonomic systems and applied their result to optimal control problems, Echeverria- 
Enriquez et al. [12] from the pre-symplectic point of view, and Blankenstein and van der Schaft [3] 
and Ibort et al. [16] using Dirac structures. 

Symmetry reduction of optimal control systems are desirable from a computational point of view 
as well. Given that solving optimal control problems usually involves iterative methods such as the 
shooting method (as opposed to solving a single initial value problem), reducing the system to a 
lower-dimensional one results in a considerable reduction of the computational cost. 

From a theoretical point of view, a certain class of optimal control problems has a rich geometric 
structure, and provides many interesting questions relating differential- geometric ideas with control- 
theoretic problems. Most notably, Montgomery [2!J] EH EU [32] [S3], following the work of Shapere 
and Wilczek [HUHT], explored optimal control of deformable bodies, such as the falling cat problem, 
from the differential-geometric point of view. In particular, principal bundles, along with principal 
connections on them defined by momentum maps, are identified as a natural geometric setting 
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for such problems. The same geometric setting applies to kinematic control of nonholonomic 
mechanical systems (see, e.g., Kelly and Murray [IB] . Murray et al. [35j Chapters 7 and 8], and 
Li and Canny [22]), where the principal connections are defined by the constraints instead of 
momentum maps. This geometric setting also gives rise to geometric phases and holonomy (see, 
e.g., Marsden et al. [25] and references therein), which have applications in motion generation of 
mechanical systems by shape change. 

1.2. Main Results and Comparison with Existing Literature. Figure [T] gives a schematic 
overview of the results in the paper and their relationships. We first characterize symmetries in 
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Figure 1 . Schematic overview of reduction of control and optimal control systems 
with references to corresponding sections in the paper. See also the outline in 
Section 11.31 



nonlinear control systems and use a principal connection to reduce such systems. We then discuss 
the associated symmetries in optimal control problem of such systems following Grizzle and Marcus 
[15] . and apply Hamiltonian reduction theory to the Pontryagin maximum principle (PMP) for 
optimal control systems with symmetries; the principal connection plays an important role here as 
well. In particular, we apply the Poisson reduction of Cendra et al. [9] to the Hamiltonian system 
given as a necessary condition for optimality by the Pontryagin maximum principle. The resulting 
Hamilton-Poincare equations give a reduced set of equations for optimality, and are naturally 
considered as a reduced maximum principle applied to the reduced control system. 

We note that Ibort et al. |16j study a similar problem in a more general setting with a slightly 
different focus: We assume that one can eliminate the control to obtain a Hamiltonian system 
on a cotangent bundle, whereas Ibort et al. [16 1 do not make the assumption and exploit Dirac 
structures to handle those cases where one cannot easily obtain a Hamiltonian system explicitly, 
an approach originally due to Blankenstein and van der Schaft [3j. Therefore, our problem setting 
and geometric framework are in fact a special case of those in |16j . At the expense of generality, 
however, we focus on the practical issue of obtaining an explicit expression for the reduced system. 
Specifically, our specialization leads us to a prescription to obtain a principal connection for a given 
optimal control system with various symmetries, as opposed to assuming, as in \1 6j . that it is given 
at the outset or deliberately choosing a particular symmetry group to simplify the geometric setting. 
In particular, for affine and kinematic optimal control systems, we may explicitly characterize the 
principal connection using the nonholonomic connection of Bloch et al. [5] . 

We also note that a reduced maximum principle (see Fig. [T] and Section 3.3) is discussed by 
Blankenstein and van der Schaft [3]; however their definition of symmetry of control systems is 
slightly more restrictive compared to that of [16] and the present paper (see Remark 2.2). Note also 
that, in their setup, it is shown that their reduced equations are simplified due to the transversality 
condition. However, this is not in general true for the case with fixed endpoints and thus the 
reduced equations become more complicated (see Remark 3.4). 
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The construction of principal connections developed here turns out to be a generalization of the 
mechanical connection used in the falling cat problem as well as those used in kinematic control of 
nonholonomic systems. In the falling cat problem, there is a natural choice of principal connection 
that arises from the problem setting, but the same construction of principal connection applies to 
kinematic control problems only by choosing certain symmetry groups to realize the same geometric 
setting; in other words, one does not have the freedom to choose the symmetry group to be used in 
the reduction. Our construction does not have such restriction and hence can be applied to a wider 
class of control systems with symmetries. 

As a result, we synthesize some previous works by showing how the basic settings of those works 
arise as special cases of our result; these include optimal control of deformable bodies mentioned 
above and also the Lie-Poisson reduction of optimal control systems on Lie groups of Krishnaprasad 

EU. 



1.3. Outline. We first define, in Section [2j symmetries in nonlinear control systems, and show 
reduction of such systems by the symmetries (see Fig. [I]). Section [3] first briefly discusses Poisson 
reduction of Cendra et al. [9] for Hamiltonian systems and then applies it to the Hamiltonian 
system defined by the maximum principle and obtain the Hamilton-Poincare equations for such 
systems. Section [4] addresses the issue of how one should choose the principal connection to be 
used in the reduction of optimal control systems. Section [5] gives various examples to show how the 
theory specializes to several previous works on the subject as well as to illustrate how the reduction 
decouples the optimal control system. 

2. Symmetry and Reduction of Nonlinear Control Systems 

2.1. Nonlinear Control Systems. Let M be a smooth manifold and tm : TM — > M be its 
tangent bundle; let E := M x M. d and see ir E : E — > M as a (trivial) vector bundl^J also let 
/ : E — > TM be a fiber-preserving smooth map, i.e., the diagram 

E ^TM 

M 

commutes. Then a nonlinear control system is defined by 

x = f(x,u). (1) 



2.2. Symmetry in Nonlinear Control Systems. Following van der Schaft [13] and Nijmeijer 
and van der Schaft [36] (see also Grizzle and Marcus [14J and van der Schaft [44J), we assume that 
the control system ([I]) has a symmetry in the following sense: Let G be a free and proper (left) Lie 
group action on M. We have <3? : G x M — > M or <3? 9 : M — > M for any g E G\ as a result we have 
the principal bundle 

7T : M -> M/G. 

The action <3? 9 gives rise to the tangent lift T<3? 9 : TM — > TM. Let us also assume that we have a 
linear representation of G on the control space M. d , i.e., we have a representation : G — > GL(d, R). 
Then we define an action of G on E = M x R d as follows: 



$> g :E^E; (x,u)^($g(x),o- g (u)) = (gx,gu), (2) 
where we introduced the shorthand notation gx := $ g (x) and gu := o~ g (u). 

Remark 2.1. In many examples, the representation ov.-j : G — > GL(d,M) turns out to be trivial. 



However, there are non-trivial cases as well: See Section 5.2 



^More generally, we may take a fiber bundle for E (see, e.g., Nijmeijer and van der Schaft [36] and references 
therein) . 
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We are now ready to define a symmetry for a nonlinear control system (see Ibort et al. |16| 
Definition 8] ) : We say that the nonlinear control system Q has a G-symmetry if the map 
TM is equivariant under the G-actions on E and TM defined above, i.e., 

T<S> g of = fo^ g , (3) 

or the diagram 

E f —^TM 

TM 



f 

commutes for any g E G. 

Remark 2.2. Note that this definition of symmetry is more general compared to that of Blankenstein 
and van der Schaft [3]. Their definition of symmetry of the control vector fields [31 Eq. (14)], i.e., 

[f(;u), QM }=0, (4) 

where qm is the set of infinitesimal generators, is rather restrictive for us, since this is not true 
in general if f(-,u) has vertical components and G is non-Abelian. To illustrate it, consider the 
extreme case where M = G, a non-abelian Lie group. Then the system is a left-invariant control 
system on G (see Section 5.1); but then qm = TG and f(x,u) E TG and so Eq. Q does not hold 
except for the very special case where f(-,u) commutes with every vector field on G. 

2.3. Symmetry in affine control systems. Consider an affine control system, i.e., Eq. ([!]) with 

d 

f(x,u) = X (x) + } j u i X i (x), (5) 
i=i 

where the control vector fields {Xi}f =1 are linearly independent on M. Let T> C TM be the 
distribution defined by 

V:=span{X l ,...,X d }. (6) 
We assume that the vector field Xq is G-invariant, i.e., for any g E G, 

T$ g oX = X o $ g , (7) 
and also that the distribution is invariant under the tangent lift of the G-action on Q, i.e., 

T<& g {V) = V (8) 

for any g E G. This implies that, for each vector field Xi for % = 1, . . . , d and any x E M and g E G, 
we have 

d 

T x %(Xi(x)) = Y J 4( x i9)X j (gx), (9) 
i=i 

where R(x, g) is an invertible d x d matrix. This gives rise to an action of G on E = M x M. d , i.e., 
fy g : E ->• E defined by 

^ g : (x, u) H> (gx, R T (x, g)u) . 
Then the G-symmetry of Xq and V, i.e., Eqs. ([7]) and Q, implies that of /, i.e., 

T x $ g (f(x,u)) =foV g (x,u). 

In particular, consider the case where R(x,g) has no dependence on x, i.e., R(x,g) = R(g); this 
is the case if, for example, M is a vector space and the action <& g : M — > M is linear. Then the 
matrix R T (g) gives the representation o~r.) : G — > GL(d,M), i.e., o g = R T (g). 
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2.4. Reduced Control System. The equivariance of the map / shown above gives rise to the 
map / : E/G —> TM/G defined so that the diagram 

E f —^TM 



TM 



E/G — js~ TM/G 

commutes, where tt g : E — >■ E/G and tt g m : TM — > TM/G are both quotient maps. Then the 
map / defines the reduced control system. 

Since E = M x ]R , the quotient E/G defines the associated bundle 

E/G = (M x R d )/G = M x G R d , 

which is a vector bundle over M/G (see, e.g., Cendra et al. [71 Section 2.3]). On the other hand, 
again following Cendra et al. [7, Section 2.3], the quotient TM/G is identified with T(M/G) © 0, 
where is the associated bundle defined as 

q:=Mx g q = (Mxq)/G 

with g being the Lie algebra of the Lie group G. More specifically, given a principal bundle 
connection form 

A:TM -»• g, (10) 
we have the identification (see [TJ Section 2.4]) 

a A :TM/G^T(M/G)®Q- [v x ] G ^ T x n(v x ) 8 [x,A x (v x )] G , (11) 

where [-] G stands for an equivalence class defined by the G-action. Therefore, we may introduce 
the maps f M/G : E/G -> T(M/G) and j~ : E/G -> defined by 

Jm/g{[ x Mg) ■= T x Trof(x,u), fg([x,u} G ) := [x,A x {f(x,u))} G 

for any element [x,«]g E E/G = M x G M. d ; these maps are clearly well-defined because of the 
equivariance of /. Then we have 

ola°] = /m/G©/§, 
and thus the reduced system is decoupled into two subsystems: 

x = Jm/g{u>x), lx = h{u x ), (12) 

where x := tt(x), u x := [x,u] G , and ^ := [x,A x (x)] G . 

3. Symmetry and Reduction of Optimal Control Systems 

This section first summarizes the fact that the G-symmetry of a nonlinear control system implies 
that of the corresponding optimal control system if the cost function is also G-invariant. We note 
that similar results are briefly discussed in Grizzle and Marcus |15j . We then show how a Poisson 
reduction may be applied to reduce the optimal control system with symmetry. 

3.1. Pontryagin Maximum Principle and Symmetry in Optimal Control. Given a cost 
function G : E — > M and fixed times to an d t\ such that to < t\, define the cost functional 

J ■- [ 1 C(x(t),u(t))dt. 

Jto 

Let xq and x\ be fixed in M. Then we formulate an optimal control problem as follows: Minimize 
the cost functional, i.e., 

rti 

minJ = min / C(x(t),u(t)) alt, 
«(•) «(•) Jt 
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subject to Eq. Q, i.e., x = f(x,u), and the endpoint constraints x(to) = xo and x{t\) = x\. 

A Hamiltonian structure comes into play with the introduction of the augmented cost functional: 
Let us introduce the costate X(t) G T*M and define 

S := / 1 [C(x(t),u(t)) + (A(t), x(t) - f(x(t), u(t)))} dt 
Jto 

Hi 



(X(t),±(t))-H(x(t),X(t),u(t)) 



to 



dt 



with the control Hamiltonian: 

H : T*M (BE^R; H{X X , u x ) = H(x, X, u) := (X x , f{u x )) - C(u x ), 

where we wrote A^ := (x, A) G T*M and u x := (x, u) G E x (recall that E = M x R d is a trivial 
vector bundle over M). If the cost function is invariant under the G-action ^ defined in Eq. Q, 
i.e., for any g £ G, 

Co^ g = C, (13) 

then the control Hamiltonian H has a symmetry in the following sense: Define an action of G on 
the bundle T*M © E by, for any g G G, 

^ g : T*M © £ -> T*M © E; (X x , u x ) ^ (T* ( A -), . 
where T*$ ff -i : T*M — >■ T*M is the cotangent lift of <& 5 . Then it is easy to show that the control 
Hamiltonian H is invariant under the G-action defined above, i.e., 

Hoi>g = H (14) 

for any g £ G. 

Now, for an arbitrary fixed X x G T*M, define ¥ C H(X X , ■ ) : — > E* as follows: For any G -Eo,, 

¥ c H(X x ,u x ),w x ) = -^-H(X x ,u x + ew x ) , 

1 d£ 6=0 

where ( • , • ) on the left-hand side is the natural pairing between elements in E* and E x . We assume 
that the optimal control u* : T*M — > E x = M rf is uniquely determined by the equation 

¥ c H(X x ,u x {X x )) = 

for any A x G T*M. This gives rise to the fiber-preserving bundle map 

u*:T*M^E- X x ^u x (X x ). 

Then one may show that the optimal control u* : T*M — > E is equivariant under the G-actions, 
i.e., 

f s ou* = u*oT\i, (15) 
and so we may define the optimal Hamiltonian H : T*M — >■ M by i? := if o u*, or more explicitly, 
£T(A,) := H{X x ,u x (X x )) = (X x , f(u*(X x ))) - C(u x (X x )). (16) 



Then the symmetry of the control Hamiltonian and the optimal control, i.e., Eq. (14) and (15), 
imply that of the optimal Hamiltonian H, i.e., 



HoT*$ g -i = H 



for any g G G. 

The Pontryagin maximum principle says that the optimal flow on M of the control system ([!]) is 
necessarily the projection to M of the Hamiltonian flow on T*M with the optimal Hamiltonian H 
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defined above. Specifically, let f2 be the standard symplectic form on T*M, ttm ■ T*M — > M the 
cotangent bundle projection, and Xh the Hamiltonian vector field defined by 

i Xn n = dH; (17) 

then there exists a solution A : [to, t±] —> T*M of the above Hamiltonian system with 7TM(A(to)) = x o 
and 7TAf(A(ii)) = x i such that its projection to M, -km ° A : [io^i] — > M, is the optimal trajectory 
of the control system (see, e.g., Agrachev and Sachkov [21 Chapter 12] for more details). In other 
words, the optimal flow on M of the control system is given by the vector field Tttm(Xh) on M. 



3.2. Poisson Reduction and Hamilton Poincare Equations. We saw that the optimal Hamil- 
tonian H is G-invariant; this implies that we can apply the results of symmetry reduction of 



Hamiltonian systems to Eq. (17) to obtain a reduced Hamiltonian system related to the optimal 



flow. Such reduction is helpful in practical applications, since it helps one to reduce the number of 



unknowns in the Hamiltonian system (17) 



Reduction of Hamiltonian systems is a well-developed subject, whose roots go back to the sym- 
plectic reduction of Marsden and Weinstein [24J; there have been substantial subsequent devel- 
opments (see Marsden et al. [26J and references therein). In our case, the Poisson version of the 
cotangent bundle reduction (see Cendra et al. [9] and Marsden et al. [261 Section 2.3]; see also 
Montgomery et al. [33] and Montgomery [28]) turns out to be a natural choice for the following 



reason: Recall that we derived the reduced control system (12) on the quotient configuration space 
M/G using the bundle T(M/G)ffig over M/G. It is natural to expect and also is desirable that the 
maximum principle, originally formulated on T*M, reduces to the dual T*(M/G) © q* , which is 
also a bundle over M/G; then we may consider it a reduced version of the maximum principle (see 
the dashed arrow in Fig. [7p. The Poisson version of the cotangent bundle reduction works precisely 
this way: The Poisson structure on T*M reduces to that on T*M/G = T*(M/G) ffig*; accordingly, 
Hamilton's equations reduce to the Hamilton-Poincare equations [9J defined of T*{M/G) © q* . 

As shown in Marsden et al. [M Lemma 2.3.3 on p. 74], the identification of T*M with T*(M/G)® 
q* is provided by the dual of the inverse of a a defined in Eq. ([TT]) : 



(a^)* : T*M/G -> T*(M/G) © g*l [AJg H- hl*^) © [x,J(X x )] G , (18) 

where hi* : T*M -> T|(M/G) is the adjoint of the horizontal lift hl x : T S (M/G) -> T X M associated 
with the connection form A : TM — > q, and J : T*M — > g* is the momentum map corresponding 
to the G-symmetry: Let £ be an arbitrary element in jj and € 3C(M) its infinitesimal generator; 
then J is defined by 

(J(\ x ),0 = (\ x ,Z M (x)). (19) 

Recall from, e.g., Marsden and Ratiu [23l Section 11.4] that Noether's theorem says that a G- 
invariance of H implies that J is conserved along the flow of the Hamiltonian vector field Xh- We 
note that Sussmann [42] formulated a generalized version of Noether's theorem for optimal control 
systems that does not require some of the assumptions we made here; however the original one 
suffices for our purpose here. 

Cendra et al. [9] exploit this identification to reduce the Hamiltonian dynamics with a G-invariant 
Hamiltonian H : T*M — > R as follows: The G-invariance implies that one can define the reduced 



Hamiltonian on T*M/G, which is identified with T*(M/G) © g* by Eq. ( |18[ ), i.e., one has H 

h the reduction of Hamilton's 

S [ tl [(p,q}-H(q,p)]dt = 

Jtn 



T*(M/G) © q* — > M. Then, through the reduction of Hamilton's phase space principle, i.e., 

'to 
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with 5q(to) = 6q(t\) = 0, one obtains the Hamilton-Poincare equations defined on T*(M/G) © q*: 

&- dS £- dB 

OX ' djl 5 

(20) 

where A © p, is an element in T*(M/G) g*; D/Dt is the covariant derivative in the associated 
bundle (see Cendra et al. Section 2.3] and Cendra et al. B is the reduced curvature form 
defined as follows (see Cendra et al. [91 Lemma 4.5]): Let hor^ : T X M — > T X M be the horizontal 
component defined by the connection form A: 

hoi x (X x ) = X x - (A x (X x )) M (x), 

where ( • )m : — > 3t(Af) is the infinitesimal generator. Also, let B be the curvature of the connection 
form A : TM — > g, i.e., it is the q- valued two- form on M defined by 

B x (Xx,y x ) = dA x (hor x (x x ),hoTx(yx)). 

Then the reduced curvature form B is the g-valued two-form on M/G defined by 

B X {X X ,Y X ) = [x,B x (X x ,y x )] G 

for any X X ,Y X E T X (M/G) and X x ,y x E T X M such that T x ir(X x ) = X x and T x ir(y x ) = Y x . In 
coordinates (see Cendra et al. [HI Section 4] for details), Eq. (20) becomes 

= OB ~ 8H 

L = ~ (B}JP + AiC^) , Aa = (|g - ^ 

where £ a and /i a are the locked body angular velocity and its corresponding momentum (see Bloch 
et al. |5l Section 5.3]) defined by 

e = e + A a J a ={Ad g - 1 A ix<g) (^g)) a , p a ={Ad* g 3(X x )) a . 

with £ = TgLg-i(g); the coefficients .4.^ are defined in the coordinate expression for the connection 
form A as follows: 

A ix>g) (x,g) = Ad g (C + A a J a )e a , 

where {e a }^™ G is a basis for the Lie algebra q. Also the coefficients Bg a for the curvature are 
given by 

BA a 8A a 

3.3. Poisson Reduction of Pontryagin Maximum Principle. Let us apply the above Poisson 
reduction to the Hamiltonian system (17) defined by the maximum principle. First calculate 
the reduced optimal Hamiltonian H corresponding to the optimal Hamiltonian (16). Using the 
identification in Eq. (18) and also the reduced optimal control 

u* : T*M/G -»■ E/G, 



which is well-defined due to Eq. (15), we can rewrite the Hamiltonian H as follows: 
H(X X ) = ((a^)*(X x ), a A o f(u*(X x ))) - C(u x (X x )) 

[KM, ^/ g (Wg)) + ([x,J(X x )] G , f$([X x ] G )) - C*([X X ] G ) 
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where we defined the reduced cost function C 1 : i?/G-^lby Co ttq = C and also 

fhi/G : T*M/G — > T(M/G), /f :T*M/G g, C* : T*M/G — >■ K 

by 

C*(Mc) :=Co^([AJ G ). 
Define the reduced optimal Hamiltonian H : T*(M/G) © g* -> R by 

fi"(Axe/ix) := (^xJm/ G {Xx® fix)) + (As.jg (A*©/**)) ~ <?*(As© A?) > ( 21 ) 

where we identified T*M/G with T*(M/G) © g* as the domain of the maps f^fc anc ^ 
Then we have H(X X ) = H(X X © As) with 

A 2 := hl*(A x ), ft x := [x, J(A X )] G . 
In coordinates, the reduced optimal Hamiltonian is 

H(x,X,fi) = \ a fy a /G (x,\,fi) + fi a f^ a (x,X,fl) -C*(x,X,fi) . 

Applying the Hamilton-Poincare equations ( |20[ ) of Cendra et al. [SJ to this particular choice of 
H gives the following: 

Theorem 3.1. Suppose that the nonlinear control system and the cost function have G- 
symmetries in the sense of Eqs. ^ and (13). Then the necessary condition of the Pontryagin 
maximum principle reduces to the following set of equations: 

% = /m/g(^ 5 © &%) ' £ = fgV^n ® ' 

DX dH / ~\ Djl (22) 

or, in coordinates, 

^ = /m/g(^> A , f 1 ) > £° = /§' a ^' A ' fi) ' 

(23) 



<9# 



A Q = - Aa(^ + ^Q,/f A, A)) , = /i & Ci(f d (x,A,A) 

Remark 3.2. Notice that the equations for (x,X,fi) are decoupled from the second one. Thus one 
first solves this subsystem and then solve the second equation to reconstruct the dynamics in the 
group variables. 

Remark 3.3. If the Lie group G is Abelian, then the structure constants C£ c vanish, and thus we 
have 

^ a = ftf/G A ' ' £ a = £ ,a > A ' A) > 

- (24) 

A Q = - JjJB^Jfi, fia = 0. 

In particular, the last equation gives a conservation of the momentum map J, which simplifies the 
set of equations further. In the non-Abelian case, the conservation of J is "hidden" in the last 
equation of ( 23 ) since the new variable fi is not J itself (which is conserved) : Recall that we defined 
pL a = (Ad*J(Aa;)) , which reduces to fi a = 3(X x ) a in the Abelian case. Notice also that, after 
solving for (x,X), the second equation (24) is solved by quadrature: The equation reduces to the 
form g^i^git) = ((t), where ((t) is a known curve in the Lie algebra g, and thus we can integrate 
the equation easily to obtain 



g{t) = ex P^ C(s)ds\ , 
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since G is Abelian and thus all the bracket terms in the iterated integrals coming from the Picard 
iteration vanish (see, e.g., Iserles [T7]). 

Remark 3.4. The reduction of Blankenstein and van der Schaft |3j takes advantage of the fact that 
the momentum map J vanishes due to the transversality condition. In our setting, the endpoints 
are fixed and thus this does not hold in general; hence the equations cannot be reduced to the 
cotangent bundle T*(M/G) as discussed at the end of Section 3 of |3j. This is why our reduced 
equations are slightly more complicated than theirs. Note, however, that the vanishing of J implies 
jl = and thus our result simplifies to theirs, except of course the differences in our settings and 
formulations mentioned in Section 11.21 and Remark V 



4. How Do We Choose the Principal Connection? 
We have shown that an optimal control system with symmetry may be reduced to the Hamilton- 



Poincare equations (22). However, we did not address the issue of how we should choose the prin- 



cipal connection form A introduced in (10). Whereas sometimes the problem setting provides a 
natural choice of principal connection, such as the mechanical connection in the falling cat prob- 
lem (see, e.g., Montgomery [29]), it is often not clear what choice has to be made. One may realize 
the same setting as the falling cat problem (the "purely kinematic" case discussed below in Exam- 



ple 4.1 ) by choosing some particular symmetry subgroup of a larger symmetry group of the system; 
however, this means that one is forced to make a particul ar ch oice of s ymmetry group even when 
a larger symmetry group is available. See, e.g., Examples 4.7 and 4.8 below: The choice G = M 2 



realizes the "purely kinematic" case but we have M? x 50(2) as a larger symmetry group. 

In this section, we show a construction of principal construction that does not impose such 
constraints on the choice of the symmetry group G. This construction is particularly explicit for 



affine and kinematic control systems (Sections 4.2 and 4.4), but may as well be formulated for more 



general settings under certain assumptions (Section 4.3). 

4.1. Principal Connection. Let 0(x) be the orbit of the G-action $ on M (defined in Section 2.2) 
through x G M, and V x be its tangent space at x, i.e., 

O(x) :={<S> g (x) G M | 5 G G} , V x :=T x O(x). 

Then a principal connection % on the principal bundle ir : Q — > Q/G is given by a G-invariant 
distribution that complements V, i.e., 

T$ g (U) = fi for V 5 G G and T X M = % x © V x for \/x G M. 

Then one may find the corresponding principal connection form A : TM — > q such that A x {£m(%)) = 
£ for any £ G g and kerAj, = 7i x for any x G M. 

4.2. Nonholonomic Connection. One example of principal connection is the so-called nonholo- 
nomic connection introduced in Bloch et al. [5J Section 6.4] (see also Cendra et al. [HI Section 3]) 



for reduction of nonholonomic mechanical systems. As we shall see in Section 4.4, the nonholo- 



nomic connection — complemented by the results of Section 4.3 — turns out to be a natural choice 
of principal connection for affine optimal control systems. 
First we make the following "dimension assumption" [5]: 

T X M = V X + V X , 

where we recall that T> is the distribution defined by the control vector fields (see Eq. Q). Now 
let (see Fig. ^ 

S x := V x n V x . 

Then one may choose, exploiting an additional geometric structure, a certain complementary sub- 
space % x of S x in T> x to write T> x as the direct sum of them: 

V X =U X ®S X . 
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One may also introduce a complementary subspace IA X to S x in V x as well: 

V x = S X @U X . 

As a result, we have the following decomposition of the tangent space T X M: 

T X M = H X (BV X = H X ®S X ® U x . 

If, in addition, % is G-invariant, i.e., T^ g (%) = H, then it defines a principal connection on the 
principal bundle ir : M — > M/G; it is called the nonholonomic connection \5\. Note, however, that 
the choice of H is not unique without some additional structure. We will come back to this issue 
later in the subsection to follow. 




7 



Figure 2. Nonholonomic connection (5j [8]. T> x is spanned by the control vector 
fields {Xi}f =l ; V x is the tangent space to the group orbit through x £ M; T~L X defines 
a principal connection. 

Using the nonholonomic connection, the reduced control system ( |12[ ) can be written as 

d d 
x = X (x) + } j UjXi(x), £ x = [x, A x ■ X (x)] G + } ] Uj [x, A x ■ Xj{x)] G , (25) 

i=l i=l 

where X{ := Tir(Xi) for i = 0, 1, . . . , d. 

The following special case, often called the "purely kinematic" case [5], gives a simple (although 
somewhat trivial) example of nonholonomic connection: 

Example 4.1 (Purely kinematic case — Control of deformable bodies and robotic locomotion). Con- 
sider the special case where the tangent space to the group orbit V x = T x O{x) exactly complements 
the G-invariant distribution T> x , i.e., S x = and thus 

T X M = V X ®V X . 

This is the special case called "purely kinematic" case or "Chaplygin systems" in the context of 
nonholonomic mechanics [S|. In this case, T> x itself gives the horizontal space T~L X and thus defines 
the connection form A : TM — > g such that ker A x = T> x (recall the G-symmetry of T>, i.e., Eq. Q). 
As a result, Eq. (25) becomes 

d 

x = X (x) +y^UiXi(x), i x = [x, Ax(X (x))] G . 
i=i 

In particular, for the drift-free case, i.e., Xo(x) = 0, we have A x {Xq(x)) = and so ^ x = 
[x, A x {x)]g = 0, which implies A x {x) = 0. With local coordinates (x,g) for M, we may express the 
connection A as 

A( x>9 ) (x, g) = kd g (g~ l g + A{x) x) , 
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where we slightly abused the notation to use A(x) as a coordinate expression for the connection 



form A[x, g )- As a result, Eq. (25) becomes 

d 

x = UiXi(x), g~ x g = -A(x)x. 
i=i 

This is the basic setting for control of deformable bodies (see, e.g., Montgomery (31]) and also of 
robotic locomotion (see, e.g., Li and Canny |22j . Kelly and Murray [18J, and Murray et al. [351 
Chapters 7 and 8]); for the former, the connection form A is defined by the mechanical connection 
(see, e.g., Marsden et al. [26j Section 2.1]) whereas for the latter it is defined by the distribution V 
arising from the nonholonomic constraints. 

However, in general, S x = T> x n V x 7^ 0, and so the choice of the subspace H x is not trivial, and 
thus we need to resort to additional ingredients to specify %; this is the topic of the next subsection. 

4.3. Momentum Map and Principal Connection. To get around the above-mentioned diffi- 
culty in specifying the principal connection T~L, we propose a way to exploit the momentum map 
(see Eq. (fl9|)) corresponding to the symmetry group. Specifically, we give a generalization of the 
mechanical connection (see, e.g., Marsden et al. |26l Section 2.1]) for systems with degenerate 
Hamiltonians. 



The main result in this subsection, Proposition 4.2 does not assume the affine optimal control 
setting, but is proved under quite strong assumptions. In Section 44 below, we show that these 
assumptions are automatically satisfied for a certain class of affine optimal control systems, and also 
that the construction of principal connection developed here gives a unique choice of the horizontal 
space H. 



Let J : T*M — > g* be the momentum map for the Hamiltonian system (17) associated with the 
optimal control of the nonlinear control system ([I]). Recall (see, e.g., Marsden et al. [261 Section 2.1]) 
that the mechanical connection form A : TM — > q is defined by 

i = r 1 ojoK, 

with the locked inertia tensor I : g — > q* and a Lagrangian L : TM — > R; FL : TM — > T*M is the 
Legendre transformation defined by 

(¥L{v x ),w x ) = ^-L(v x + ew x ) 

d£ 6 = 

for any v x ,w x G T X M. This definition does not directly apply to our setting, since there is usually 
no such Lagrangian L in the optimal control setting. 

Therefore, we need to generalize the notion of the mechanical connection here: Let $:GxM-> 
M be a free and proper action of a Lie group G, and H : T*M — >• R be a Hamiltonian. Define 
WH : T*M -> TM by 

(WH(a x ),f3 x ) = ^H(a x + ep x ) 

d£ 6=0 

for any a x , f3 x £ T*M. We assume that FH :T*M -> TM is linear and thus U C TM defined by 

U :=¥H{J- 1 (0)) 

gives a distribution on M. Then, under certain assumptions, H gives a principal connection on 
7T : M -> M/G: 

Proposition 4.2. Let V x = T x O(x) be the tangent space to the group orbit O of the action <£. 
Suppose that the Hamiltonian H is G -invariant, FH : T*M — > TM is a linear map that is non- 
degenerate on J _1 (0), and also that the intersection of V and % is trivial, i.e., 7i x n V x = for 
any x £ M. Then % defines a principal connection on ir : M — ^ M/G. 
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Proof. See Appendix |A} □ 



The G-invariance of the Hamiltonian is always satisfied in our setting as mentioned in 3.2 The 
other conditions are somewhat contrived, and it is not clear as to whether one may further scrutinize 
and weaken the conditions for general settings. However, in the next subsection, we show that the 
linearity of WH and H x D V x = are automatically satisfied for a certain class of affine optimal 
control problems. 

4.4. Application to Affine Optimal Control Systems with Quadratic Cost Functions. 

We apply Proposition 4.2 to a certain class of affine optimal control problems and show that 
Proposition |4.2| helps us identify the unique principal connection Ti even in the non-purely kinematic 
case. 

Consider the following affine optimal control problem: 

d 1 

x = X (x) + ^2 UiXi(x), C(x, u) = -gijUiUj, (26) 

i=\ 

where g^ '■= g(Xi,Xj) for 1 < i,j < d with a G-invariant sub-Riemannian metric g on M that is 
positive-definite on the distribution T> := span{Xi, . . . , AT^}. 

Let us first introduce a couple of notions to be used in the discussion to follow: 

Definition 4.3. The drift-free control Hamiltonian H d f '■ T*M © E — > M. for the affine control 
system ( 26 ) is defined by 

d 1 
H d i(X x ,u x ) ■= (X x ,Xi(x)) - -gijUiUj. 

8=1 

Setting F c Hd{ = F c /f = gives the optimal control 

u* j (\ x ) = (\ x ,X j (x)), (27) 

and thus we may define the drift-free optimal Hamiltonian H^t '■ T*M — )• M by 

1 

T 

For kinematic control systems, i.e., Xq(x) = 0, we have -f^df = H and = H. 



H d f(X x ) := H di (X x ,u x (X x )) = -g ij (X x ,Xi(x)) {X x ,Xj{x)) . 



Remark 4.4. As we shall see below, the drift-free optimal Hamiltonian is used merely to define a 
map from J -1 (0) C T*M to TM. Note also that H d { is degenerate unless d = m := dimM, i.e., 
the system is fully actuated. 



Proposition 4.5. S uppo se that the affin e optimal control system (26) is G-invariant in the sense 
described in Example 2.3 and Section 3.1, and also that FiJ^f : T* M — > TM restricted to J -1 (0) is 
non- degenerate. Then the distribution 

U := ¥H di (J _1 (0)) C TM (28) 

defines a principal connection on ir : M — > M/G. 

Proof. Clearly, the G-invariance of the optimal control system implies that of H d f as well. Therefore, 
by Proposition 4.2, it remains to show T~L X n V x = 0. 

First notice that the Legendre transformation Fifjf : T*M — > TM is given by 

a x ^ ¥H di (a x ) := g ij {a x ,Xi(x)) X^x). (29) 

Let ^ be an element in q such that £m(%) is in H x . Then ^m(^) = ¥H d [(a x ) for some a x G J _1 (0), 
and thus, we have, using the definition of the momentum map J, 

(a x ,¥H d{ (a x )) = {a x ,£ M (x)) = {J{a x ),£) = 0. 
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On the other hand, 

(a x ,¥H df (a x )) = g ij (a x ,Xi(x)) (o^,Xj(x)) . 

Since g 13 is positive definite, we have (a x , Xj(x)) = for j = 1, . . . d and hence Cm(x) = ¥H^(a x ) 
0. Therefore, it follows that H x n V x = 0. 



□ 



Remark 4.6. It is clear from Eqs. (28) and (29) that T~L X is a subspace of V x . Since T X M = Tl x ®V x 



as well, the definition of % x coincides that of Section [42 

Let us first show the purely kinematic case: 

Example 4.7 (Snakeboard [5j El [201 [38] with M 2 -symmetry). We consider a kinematic optimal 
control problem of the snakeboard shown in Fig. KM The configuration space is M = SE{2) xS 1 x 




Figure 3. The Snakeboard. 

S 1 = {{x\,X2, 9, ip, (j))}. The velocity constraints are given by 

x\ + (r cos cot 4>) 8 = 0, ±2 + (r sin 8 cot 4>) 8 = 0, 
and thus we have V = span{Xi, X2, X3} with 

d . „ d tanc/> 5 „ , , 5 

00' 



Xi(x) 



cos v — h sin C/ 7: — 

axi 0x2 



X 2 (x) 



X 3 (x) 







r OV ' ' dip 

where x = (xi,X2, 8, ip,<j)). Therefore, we may consider the following kinematic control system: 

x = f(x,u) := uiX 1 (x) + u 2 X 2 (x) + u 3 X 3 (x), 

or more explicitly, 

tan d> 



xi = u±cos8, X2 = uisin8, 
We define the cost function C : SE(2) x R 3 

C(x, u) = 



1 



-ui- 



as follows: 



t 2 1 2 1 2\ 



U 3 . 



Then the above control system has an SE{2) x 5'0(2)-symmetry, where SE{2) acting on the SE{2) 
portion of M by left multiplication and SO{2) acting on the first S 1 in M, i.e., the variable ip. Here 
we choose the subgroup G = R 2 of SE(2) x 50(2) to show that it realizes the purely kinematic 



case (see Example 4.1). 
Let & : G x M 



Also let a : G x R 3 



M be the G-action on M, i.e., 

$ : ((a, 6), (x 1 ,x 2 , 8, ip, (j))) i-> (xi + a, x 2 + b, 8, ip, (p). 
-> R 3 be the trivial representation: 

a : ((a, 6), (ni,n 2 ,n 3 )) i-> (ui,u 2 ,u 3 ), 
which induces the action ^ : G x E ^ E defined by 

^ : ((a, 6, /?), (xi, x 2 , 8, ip, (j), ui,U2,u 3 )) (->■ (xi + a, x 2 + 6, 0, -0, 0, «i, n 2 , u 3 ). 
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The momentum map J : T*M — > T( 0j o)^ 2 — H^ 2 associated with the action of M 2 is 

J(a?i, x 2 , 0, ip, cf>, Ai, A 2 , Xe, \<j,) = (Ai, A2), 

and then 

H = FF(J- 1 (0)) =span{Xi,X 2) X 3 }. 
So % = T>, and thus this is a purely kinematic case. 

A different choice of symmetry group renders the problem non-purely kinematic. The follow- 
ing example illustrates it; the results here will be later used in the reduction of the system in 
Example |5.2| 



Example 4.8 (Snakeboard with M? x 50(2)-symmetry). Now we choose G = M. 2 x 5*0(2); this is 



an Abelian case (see Remark 3.3) that gives rise to a non-purely kinematic case. 



Let $ : G x M -> M be the G-action on M, i.e., 

$ : ((a, b, 0), (xi,x 2 , 6, ip, <j>)) i-> [x x + a, x 2 + 6, 0, tp + p, cp). 

Also let a : G x IR 3 — > M 3 be the trivial representation: 

a : ((a, b, (3), (m, u 2 , u 3 )) i-> (m, u 2 , u 3 ), 

which induces the action ^> : G x E ^ E defined by 

* : ((a, 6, P), (xi,x 2 , 0, tp, cj), m,u 2 ,u 3 )) i-> (21 + a, x 2 + 6, 0, ^ + P, (p, u 1} u 2 , u 3 ). 

Then it is straightforward to show that / and C satisfy the symmetry defined in Eqs. ^ and (13) 
respectively. 

J T . T^* 71 JT . n 

(0,0) 



The momentum map J : T*M -> T (0j0) IR 2 x so (2) = M 3 associated with the action of M 2 x SO (2) 



is 

J(xi,x 2 , 9, ip,(f>, Ai,A 2 , A0, A^,, A,/,) = (Ai, A 2 , A,/,), 

and so 

„ = FJT(J-'(0)) = S pan{c„ sS A t « A - ^* » , £} = span^X,} . 

Since 2? = span{A"i, X 2 , X 3 }, this is not a purely kinematic case, and S = span{A 3 }. The connec- 
tion form A : TM — > g is then given by 

A.f$ t <p) = (dx\ + r cos 6 cot <p d9) ® ei + (<ix 2 + r sin 6 cot d0) ® e 2 + dV> ® , (30) 

where {ei, e 2 , e^,} is a basis for the Lie algebra T^q^M. 2 x so (2) — K 3 - We then identify the vertical 
space U as follows: 



U = span 



f g 9 _0\ 



The reduced curvature form i3 is then 

= r cos $ csc;2 <pd9 A dip ® e\ + r sin esc 2 (pd6 A dep ® e 2 . (31) 

5. Examples 

This section shows various examples to illustrate how the theory specializes to several previous 



optimal control system (Section 5.4) 



works on the subject (Sections 5.1-5.3), as well as to illustrate how the reduction decouples the 
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5.1. Lie Poisson Reduction of Optimal Control of Systems on Lie Groups. Consider, as 
a special case, the nonlinear control system |l| on a Lie group G, i.e., M = G, with symmetry 
under the action of G on itself by left translation: 

L g : G — >■ G; h\-t gh 

for any g G G. This case is particularly simple because we do not need a principal connection and 
the reduced system is defined on the Lie algebra g. 

Recall that the associated bundle E/G = M x G R d is a bundle over M/G; however, M = G 
here, and so its base space becomes G/G, i.e., a point; hence E/G = R d = {u} and the map Jm/g 
becomes immaterial here. On the other hand, the quotient TM/G becomes TG/G = g. Therefore, 
we have f s : R d — > g and the control system reduces to 

C(t) = f a (u(t))- 

where £ := T g L g -i(g). 

In particular, consider the affine control system ^ on the Lie group G. The invariance of 
Xq, i.e., Eq. Q, implies that there exists an element Co £ such that Xo(g) = T e L ff (Co) for any 
g G G, where e G G is the identity. Likewise, the invariance of the distribution T> C TG, i.e., 
Eq. ([8]), implies that there exists a subspace t) in the Lie algebra g of G such that V g = T e L g (D) 
for any g G G; so there exists a basis {Q}f =1 for d such that Xi(g) = T e L g (Q) for any g G G and 
i = 1, . . . , d. Therefore, Eq. Q implies that the matrix R(h,g) becomes the d x d identity matrix 
for any h,g G G. So the corresponding action ^ g : G x W 1 — > G x M. d becomes trivial on the second 
slot: 

V g :{h,u)^(gh,u). (32) 

Hence the quotient E/G becomes 

E/G = (G x R d )/G = (G/G) xR d ^R d = {u}, (33) 
whereas we have TG/G = g. Now, since / : G x R d TG takes the form 



fid, u) = T e L g ^Co + ' 



we obtain the map f g : R d -»• defined by 

d 

/ 8 («) :=Co + 5>iC»- (34) 
i=i 

Therefore, we have the following reduced control system in the Lie algebra g: 

d 

z{t) = (o + J2 u ^^ 
i=i 

This is the case considered by Krishnaprasad [21] (see also Sachkov [391 Section 3]). 

Now, assume that the cost function C : E — )■ M is also G-invariant, i.e., C o ^ = C for any 



h E G; then Eq. (32) implies that, for any g G G, we have C(g,u) = C(e,u) = C(u), where C is 



defined on £/G = R d (recall Eq. <j33j)). 

In this case, the quotient M/G becomes a point and thus the bundle T(M /G)®g becomes just g; 
as a result, £ is equal to £. Notice also that, since the momentum map is given by J(A S ) = T*R g (X g ), 
we have 

A = [5, J(A fl )] G = [e, Ad* J(A,)] G - Ad* J(A S ) = T e *L 9 (A 9 ) G g*, 
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which is the "body angular momentum." Therefore, the Hamilton-Poincare equations (20) reduce 
to the Lie-Poisson equation [9]: 

dti dp. * _ 



So Eq. (22) becomes 



This system with an afhne control, Eq. (34), and the cost function of the form 

d 

C{g,u) = C{u) = -Y,huj 

8=1 

is the case considered by Krishnaprasad [21] (see also Koon and Marsden [191 Section 5.3] and 
Sachkov [39, Section 7]). 

5.2. Clebsch Optimal Control Problem. Consider the following control system defined by a 
group action: Let M be a manifold and G a Lie group, and suppose that a d-dimensional Lie group 
G acts on the manifold M; hence we have the infinitesimal generator um £ X(M) for any element 
u in the Lie algebra g. Now consider the control system ([I]) with / : M X g — > TM defined by 

f(x,u) = u M (x), (35) 

where the element u in g is seen as the control here (note that g = M. d as a vector space) . This is a 
control system associated with the Clebsch optimal control problem (see Cotter and Holm |10| and 
Gay-Balmaz and Ratiu |13j). 

This problem provides a g ood example where the action a : G x M. d — > M. d to the control space 
M. d is non-trivial (see Remark 2.1 ). We define an action of G on E = M x g as follows: 

: M x g -)■ M x g; (x, u) (->■ (<J> s (x), Ad 5 u) . 

Then the equivariance of the infinitesimal generator (see, e.g., Abraham and Marsden [H Propo- 
sition 4.1.26]), i.e., (Adgu)M(gx) = T x Q> g (uM(x)), gives the equivariance of /, i.e., Eq. Now 
E/G = M Xg g ='■ Q, and so we have Jm/g '■ — * T(M/G) and fg : g — > g defined by 

Im/gO?, u ]g) = T x tt(um(x)) = 0, /g([x, u]g) = [x, Ab(«m (»))]c? = [x, u]g- (36) 

Then the reduced system becomes 

x = 0, £ s = [x,A x (x)] G = [x,u]g- 

Hence the point x in the base space M/G is fixed, and so the system evolves only in the vertical 
direction, as one can easily see from Eq. (35). Therefore, the system is further reduced to 

A x (x) = u. 

Given a cost function l:Mxg->M such that £(x, u) = £(u), consider the problem of minimizing 
the integral 

rT 

l{u(t))dt 

o 



subject to Eq. (35), x(0) = xq, and x(T) = xt; where xo and xt are fixed points in M. 
It is easy to see that the optimal control is given by 

di dl 
¥ c H(\ x ,u x (X x ))=J(X x )-—(u x (Xx)) = J(\ x ) = —( u *(\ x )), (37) 



assuming this uniquely defines u x (X x ) [13 
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Now, from Eq. @, P M/G ([X X ] G ) = and /^[AJg) = [x,u*(X x )] G . Therefore, Eq. Q gives 

1 = 0, |=[x,u*(Ax)] G , 
DX dH Dji 



Dt~ dx : 

The second equation gives 

[x,£]g = [x,u*(X x )] G 
and the fourth gives, writing fi = [x,/j]g, 
D 



Dt 



Z = u*(X x ), 



Dt 



[x,n] 



G 



ad 



/i = ad| fi. 



since the curve x(t) is vertical, i.e., ir(x(t)) = x is fixed. However, recall that /i = [x,^]g := 
[x,J(X x )]g and thus [i = J(X X ); then, substituting Eq. (37) into the above equation, we obtain the 
following Euler-Poincare equation: 

d di di 
--K(A,)) = ad;. ( ^-K(A,)). 

This is essentially Theorem 2.2 of Gay-Balmaz and Ratiu |13j . 



5.3. Kinematic Optimal Control — Purely Kinematic Case. As shown in Sections 4.4 



our 



construction of principal connection is explicit for affine and kinematic sub-Riemannian optimal 
control problems. For the purely kinematic case as in Example 4.1, our result recovers that of |27| : 

Example 5.1 (Wong's equations [27}I45|: see also [7J Chapter 4]). For the kinematic sub-Riemannian 
optimal control problems (see Montgomery [291 EH EH [32j [33] and Bloch [H Section 7.4]), we have 



f(x,u) = J2u a X a (x) 



a=l 



and, given a G-invariant sub-Riemannian metric g on M that is positive-definite on the distribution 
T> := span{A"i, . . . , X^}, the cost function is defined as 

1 



C(x, u) 



a S 

-g a pu a u p , 



where g aj3 := g{X a , Xp). 

Assume that the distribution D is G-invariant and also defines a principal connection form A on 
the principal bundle ir : M — > M/G; this is the "purely kinematic" case from Section 5.3 In this 
case, f(x,u) takes values in V; hence A(f(x,u)) = and thus fg([x,u]G) = 0. Therefore, Eq. (23) 
gives 



X, 



Assume that we can write 



dH 



K = -hC b da A d a x a 



f^ /G (x,u) = u a . 

Then the optimal control u* is given by u* ,a = g a ^Xp, and so the reduced optimal Hamiltonian (21 ) 
is given by 

H(x,X) = lg aP X a Xa, 
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where g al3 is the inverse of g a p. Therefore, we obtain x a = g a @ Xp and £ a = coupled with Wong's 
equations: 



X a 



2 dx a 

5.4. Kinematic Optimal Control — Non-Purely Kinematic Case. This is the case of main 
interest in this paper. Since it is non-purely kinematic, the distribution T> does not define the 
principal connection, and hence we need to first find the principal connection. We focus on the 
Abelian case here, because, as mentioned in Remark 3.3, the reduced optimal control system is 
particularly simple if the symmetry group G is Abelian. The following kinematic optimal control 
problem illustrates it (recall that the principal connection is found in Example 4.8): 

control 

^2 = \l>i 



Example 5.2 (Snakeboard: Example 4.8). The optimal control u*, Eq. (27), is given by 

tan d> 



Ai cos 9 + Ao sin 9 — Xg 



and then the optimal Hamiltonian is 

1 



H(x,X) 



Ai cos 9 + A2 sin 9 — \ 



which gives the optimal control system 
cos " 







r\\ cos 9 + rA2 sin 9 — Xg tan 

■ tan 9 

r 

A 2 



tan ( 



sin l 



x 2 



(rAi cos 9 + r\ 2 sin 9 — Xg tan i 



(r Ai cos 9 + r A 2 sin 9 — Xg tan < 
Ai sm9 — A2 cos( 



■4> = K 



(j) = A„ 



0. 



Xe 



(rX\ cos 9 + r A 2 sin 9 — Xg tan < 



A 



0. 



A, 



Xg sec 



(rAi cos 9 + r A2 sin 9 — Xg tan 1 



(38) 

Let us perform the reduction. Introducing A E T*{M/G), £ G g, and /I £ g* defined by (see 
Eq. ( 30 ) for the expression of the connection form .A) 

<V,<£) = (^0, <V) := hi* (As) =(Ae - Ai r cot cos 9 - A 2 rcot (/>sin<9, A^) , 

i(6,<f>) = (li>l2,lv) := k)^(^)]G = (ii - (rcot</> cos9)9, x 2 - (rcot^ sin#)0, ^ , 

A(0,4>) =(AljM2)A^) := [ x ! J(^s;)]g =(Ai, A 2 ,A^,) , 
the reduced optimal Hamiltonian (21) is written as 



H[x,X,fi) 



1 / A« tan 2 1 



+ A^ + 



As a result, the reduced optimal control system (24) gives (see Eq. (31) for the expressions of the 
curvature B) 



tarn 



Xg, 4> 



X<h, £1 — 0, £2 — 0, £w, — /L/,, 
Ae = A^ r esc 2 ^ (/ii cos 9 + f2 2 sin 0) , A<^ = — Ag sec 2 ^ (A# tan <f> + fixr cos 9 + \li r sin 9) 

Ai = °, #2 = 0, £ty = 0. 



This system is significantly simpler than the original optimal control system (38): Notice that we 
now have a decoupled subsystem for the variables (9, cj>, Xg, A^); so we may first solve the subsystem 
and then obtain the dynamics for (x,y,ip) by quadrature (see Remark 3.3). 
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6. Conclusion 

We introduced the idea of symmetry reduction and the related geometric tools in Hamiltonian 
mechanics to nonlinear optimal control systems to define reduced optimal control problems. Our 
main focus was on affine and kinematic optimal control problems. Particularly, we identified a 
natural choice of principal connection in such problems to perform the reduction explicitly. The 
principal connection provides a way to decouple the control system into subsystems, and also, com- 
bined with a Poisson reduction to the Pontryagin maximum principle, decouples the corresponding 
optimal control system into subsystems as well. The resulting reduced optimal control system 
is shown to specialize to some previous works. We also illustrated, through a simple kinematic 
optimal control problem, how the reduction simplifies the optimal control system. 
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Appendix A. Proof of Proposition 14.21 

Lemma A.l. If the Hamiltonian H : T*M — >• M is G -invariant, then the distribution % C TM is 
G-invariant as well, i.e., T^ g {T-L) = % for any g £ G. 

Proof. Let us first show that FH : T*M TM is equivariant, i.e., T$ g o FH = FH o T*$ g -i. For 
any a x G T*M and f3 gx G T gx M, we have, using the G-invariance of H, 



-H(a x + eT*%{p x )) 



(!3 x ,T%oFH(a x )) = (T*$ g (p x ),FH(a x )) 

d 
de 

= ^H(T*<S> g -i(a x )+e(3 x ) 
= (p x ,FHoT*$ g -i(a x )}. 



e=0 



£=0 



On the other hand, J (0) C T*M is G-invariant: Let a x G J (0); then, for any g G G and £ G g, 

(jorVK),^} = (T*%-,(a x ),( M (gx)) 
= (a x ,T<& g -i ■ £m(9x)) 
= (a x , (Ad ff -i Om(x)) 
= (J(a x ),Ad g -it) 
= 0, 

which implies T*Q g -i(a x ) G J -1 (0); thus we have T*& g -i (j -1 (0)) C J -1 (0). This in turn implies 
the other inclusion: For if a x G J _1 (0) then 



a x = T<S> g -ioT$ g (a x )eT$ g -i(J- 1 (0)), 
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because TQ g (a x ) G T*$ 9 (J 1 (0)) C J 1 (0) from what we have just shown. As a result, we have 
T*^-!^- 1 ^)) = J _1 (0), and thus 

T<5> g {U) = T$ g o FF(J _1 (0)) 

= FH oT*$ g -i(J _1 (0)) 
= F#(.T 1 (0)) 

= U. □ 



Proof of Proposition 4-2 Since $ is a free action, any element in g* is a regular value (see, e.g., 
Marsden et al. |26} Section 1.1]). Therefore, J -1 (0)nT*M defines a subspace of T*M of codimension 
dim G, because J : T*M — > q* is linear in the fiber variables of T*M. Since FH is assumed to be non- 
degenerate on J _1 (0), H := Fi2"(j _1 (0)) defines a subspace of T X M of codimension dimG for each 
x £ M, whereas dim V x = dimC Therefore, the assumption H x n V x = implies T X M = T~L X © V : 



x ■ 



By the above lemma, % is G-invariant, and thus defines a principal connection. □ 
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